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ABSTRACT 



A method and apparatus for performing storage and 
retrieval in an information storage system is disclosed 
which uses the hashing technique. In order to prevent 
contamination of the storage medium by automatically 
expiring records, a garbage collection technique is used 
which removes all expired records in the neighborhood 
of a probe into the data storge system. More particu- 
larly, each probe for insertion, retrieval or deletion of a 
record is an' occasion to search the entire chain of re- 
cords found for expired records and then removing 
them and closing the chain. This garbage collection 
automatically removes expired record contamination in 
the vicinity of the probe, thereby automatically decon- 
taminating the storage space. Because no long term 
contamination can build up in the present system, it is 
useful for large data bases which are heavily used and 
which require the fast access provided by hashing. 

8 Claims, 6 Drawing Sheets 
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searched to locate desired records. With the passage of 

METHODS AND APPARATU S FO R time, such storage contamination can reduce the perfor- 

INFORMATION STORAGE AND RETRIEVAL mance of retrieval operations below acceptable levels. 

UTILIZING HASHING TECHNIQUES Problems of this type are discussed in considerable 

5 detail in Data Structures and Program Design, by R. L. 

This application is a continuation of application Scr. Knise, Prentice-Hall. Englewood Cliffs, N.J., 1984, pp. 

No. 07/151.639, fUed Feb. 2, 1988 now abandoned. 112-126, and Data Structures with Abstract Data Types 

TECHNICAI FiFi n PASCAL by D. F. Stubbs and N. W. Webre, 

TECHNICAL FIELD Brooks/Cole Publishing. Monterey, Calif., 1985, pp. 

This invention relates to information storage and 10 310-336. 

retrieval systems and, more particularly, to the use of In the prior art, such storage space contamination 

hashing techniques in such systems. was avoided by deletion procedures that eliminated 

BACKGROUND OF THE INVENTION ^""^^^ .^^ 

another record m the collision-resolution chain of re- 
Infonnatioo or data stored in a computer^controUed 13 cords and thus dose the chain without leaving any 
storage mechanism can be retrieved by searching for a deleted records. One such procedure is shown in the 
particular key in the stored records. The stored record aforementioned text by Knuth at page 527. Unfortu- 
with a key matching the search key is then retrieved. nately, such non-contaminating procedures, due to the 
Such searching techniques require repeated accesses or necessity for successive probes into the storage ^lace, 
probes into the storage mechanism to perform key com- 20 take so much time that they can be used only when the 
parisons. In large storage and retrieval systems, such data base is off line and hence not available for access- 
searching, even if augmented by efTicient search algo- ing. 

rithms such as a binary search, often requires an exces- The problem, then, is to provide the speed of access 
sivc amount of time. of hashing techniques for large and heavily used infor- 

Another well-known and much faster method for 25 mation storage systems having expiring data and, at the 
storing and retrieving information from computer store same time, prevent the large-scale contamination which 
involves the use of so-called "hashing" techniques. normally results from expired records in such large and 
These techniques are also sonaetimes called scatter-stor- heavily used systems, 
age or key-transformation techniques. In a system using 

hashing, the key is operated upon (by a hashing func- 30 SUMMARY OF THE INVENTION 

tion) to produce a storage address in the storage space In accordance with the illustrative embodiment of the 
(called the hash toble). This storage address is then used invention, these and other problems are overcome by 
to access the desired storage location directly with using a garbage collection procedure '*on the fly" while 
fewer storage accesses or probes than sequential or other types of access to the storage space are taking 
binary searches. Hashing techniques arc described in 35 place. In particular, during normal dau insertion or 
the classic text by D. Knuth entitled The Art of Com- retrieval probes into the date store, the expired, obsolete 
puter Programming, Volume 3. Sorting and Searching, records arc identified and removed in the neighborhood 
pp. 506-549, Addison- Wesley. Reading. Mass., 1973. of the probe. Specifically, expired or obsolete records in 
Hashing functions are designed to translate the uni- the collision-resolution chain including the record to be 
verse of keys into addresses uniformly distributed 40 accessed are removed as part of the normal retrieval 
throughout the bash uble. Typical hashing operations procedure. 

include truncation, folding, Uansposition and modulo This incremental garbage collection technique has 
arithmetic. A disadvantage of hashing techniques is that the decided advanuge of automatically eliminating 
more than one key can translate into the same storage contamination caused by obsolete or expired records 
addre^ causing "collisions" in storage or retrieval 45 without requiring that the daU base be taken off-line for 
operations. Some form of collision-resolution strategy such garbage collection. This is particularly important 
(sometimes called "rehashing") must therefore be pro- for daU bases requiring rapid access and continuous 
vided. For example, the simple strategy of searching availability to the user population, 
forward from the initial storage address to the first 

empty storage location will resolve the collision. This 50 BRIEF DESCRIPTION OF THE DRAWING 
latter technique is called linear probing. If the hash table A complete understanding of the present invention 
is considered to be circular so that addresses beyond the may be gained by ocMisidering the following detailed 
end ofthe table map back to the beginning of the table, description in conjunction with the accompanying 
then the linear probing is done with "open addressing,^ drawing, in which: 

Le., with the entire hash table as overflow space in the 55 FIG. 1 shows a general Mock diagram of a con^ter 
evait that a collision occurs. system hardware arrangement in which the information 

Some forms of dau records have a limited lifetime storage and retrieval system of the present invention 
after which they become obsolete. Scheduling activi- might be implemented; 

ties, for example* involves records which become obso- FIG, 2 shows a general block diagram of a computer 
Icte alter the scheduled activity has occurred. Such 60 system software arrangement in which the information 
feoord storage locations cannot be simply emptied since storage and retrieval system of the present invention 
diis location nuy be a link in a chain of locations previ- might find os^ 

oi^y created during a collision-resolution procedure. HO. 3 shows a general How chart for table searching 
The classic solution to this problem is to mark the re- operaUon which might be used in a hashed storage 

" "^'gtcd" than as "empty," and to leave 65 system in accordance with the present invention; 

the record in place. In time, however, the storage space FIG. 4 shows a general flow chart for a garbage 
can become contaminated by an excessive number of collecting remove procedure which forms part of the 
deleted or obsolete storage locations that must be table searching operation of FIG. 3; 
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FIG. 5 shows a general flow chart for record inser- 
tion operations which might be used in a hashed storage 
system in accordance with the present invention; 

FIG. 6 shows a general flow chart for a record re- 
trieval <^>eration for use in a hashed storage system in 
accordance with the present invention; and 

FIG, 7 shows a general flow chart for a record dele- 
tion operation which might be used in the hashed stor- 
age system in accordance with the present invention. 

To facflitate reader understanding, identical refer- 
ence numerals are used to designate elements common 
to the figures. 

DETAILED DESCRIPTION 

Referring more particularly to FIG. 1 of the draw- 
ings, there b shown a general block diagram of a com- 
puter hardware system comprising a Central Processing 
Unit (CPU) 10 and a Random Access Memory (RAM) 
unit 11. Ccnnputer programs stored in the RAM 11 are 
accessed by CPU 10 and executed, one instruction at a 
time, by CPU 10. Data, stored in other portions of 
RAM 11, are operated upon by the program instruc- 
tions accessed by CPU 10 from RAM 11, all in accor- 
dance with well-known data processing techniques. 

Central Processing Unit (CPU) 10 also controls and 
accesses a disk controller unit U which, in turn, ac- 
cesses digital data stored on one or more disk storage 
units such as disk storage unit 13. In normal operation, 
programs and data are stored on disk storage unit 13 
imtil required by CPU 10. At this time, such programs 
and data are retrieved from disk storage unit 13 in 
blocks and stored in RAM 11 for rapid access. 

Central Processing Unit (CPU) 10 also controls an 
Input-Output (lO) controller 14 which, in turn, pro- 
vides access to a plurality of input devices such as CRT 
(cathode ray tube) terminal 15, as well as a plurality of 
output devices such as printer 16. Terminal 15 provides 
a mechanism for a computer operator to introduce in- 
structions and conunands into the computer system of 



number of utility programs 22 of general use to the 
computer user. Utilities 22 might, for example, comprise 
assemblers and compilers, nuthematical routines, basic 
file handling routines and system maintenance facilities. 
5 Many computer software systeoks also include a data 
base manager program 23 which controls access to the 
data records in a data base 24. Data base 24 nuy, for 
example, reside on a disk storage unit or units such as 
disk storage unit 13 of FIG. 1. User application pro- 
10 grams such as application program 25 tbien tise the dau 
base manager program 23 to access data base records in 
data base 24 for adding, deleting and modifying dau 
records. It is the efficient realization of a data base man- 
ager such as data base nunager program 23 in FIG. 2 to 
15 which the present invention is directed. 

Before proceeding to a description of one embodi- 
ment of the present inventioo, it is first useful to discuss 
hashing techniques in general. Hashing techniques have 
been used classically for very hst access to static, short 
20 term data such as a compiler symbol table. Typically, in 
such storage tables, ddetions are infrequent and the 
need for the storage table disappears quickly. 

In some common types of data storage systems, data 
records become obsolete merely by the passage of time 
2S or by the occurrence of some event. If such ex|Hred, 
lapsed or obsolete records are not removed from the 
storage table, they wUl, in time, seriously degrade or 
contaminate the performance of the retrieval system. 
Contamination arises because of the ever-increasing 
30 need to search longer and longer chains of record loca- 
tions, many of which are expired, to reach a desired 
location. 

More particularly, a hash table can be described as a 
logically contiguous, circular list of consecutively num- 
35 bered, fixed-sized storage units, called cells, each capa- 
ble of storing a single item called a record. Each record 
contains a distinguishing field, called the key, which is 
used as the basis for storing and retrieving the associ- 
ated record. The keys throughout the hash table data 
FIG. 1, and may be supplemented with other input 40 base are distinct and uniqtie for each record. Hashing 



devices such as card and tape readers, remotely located 
terminals, optical readers and other types of input de- 
vices. Similarly, printer 16 provides a mechanism for 
displaying the results of the operation of the computer 
system of RG. 1 for the computer user. Printer 16 may 
similarly be supplemented by line printers, cathode ray 
tube displays, phototypesetters, graphical plotters and 
other types of output devices. 
The constituents of the computer system of FIG. 1 



functions which associate keys with storage addresses 
are usually not one-to-one in that they map many dis- 
tinct keys into the same location. 
To store a new record, a cell number is generated by 
45 invoking the hashing function on the key for the new 
record. If this cell location is not occupied, the new 
record is stored there. If this cell location is occupied, a 
collision has occurred and the new record must be 
stored elsewhere, in an overflow area, using an appro- 



and their cooperative operation are well-known in the 50 priate oollision-resoluticHi technique. A common colli- 



art and are typical of all computer systems, from small 
persona] computers to large main frame systems. The 
architecture and operation of such systems are well- 
known and, mnce they form no part of the present in- 
veoticm, will not be further described here. 

In FIG. 2 there is shown a graphical representation of 
a typical software architecture for a computer system 
Mich as that shown in FIG. 1. The software of FIG. 2 
txmpriaa an access mechanism 20 which, for simple 
personal computers, may comprise no more than turn- 
ing the system on. In larger systems, |»oviding service 
to a larger number of users, login and password proce- 
dures would typically be implemented in access mecha- 
nism 20. Once access mechanism 20 has completed the 
ktgin procedure, the user is placed in the operating 
system environment 21. Operating system 21 coordi- 
nates the activities of all of the hardware components of 
the computer system (shown in FIG. 1) and provides a 



55 



60 



65 



sion-resolution strategy, which wUl be described here, is 
known as linear probing under open addressing. Open 
addressing means that the overflow area is the entire 
hash table itself. Linear protog indicates tequential 
scanning of cells beginning with the next cell, recalling 
that the storage table is viewed circularly. The collision 
is resdved by storing the record in the first unoccupied 
cell found. 

To retrieve a record, the key is hashed to generate a 
cell location. If the record is not there (the keys do not 
match), searching continues following the same for- 
ward path as record storage. An empty cell terminates 
the retrieval procedure, which has then failed to find 
the record to be retrieved. 

In FIG. 3 there is shown a flowchart of a search table 
procedure for searching the hash table preparatory to 
inserting, retrieving or deleting a record. The hash table 
may, for example, comprise the data base 24 of RG. 2 
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and the search table procedure of FIG. 3 comprise a record to be removed in a forward direction searching 
portion of the data base manager 23 of FIG. 2. Starting for a record whose key hashes at or behind the ceU to be 
in box 30 of the search table procedure of FIG. 3, the removed. When such a record is found, it is copied to 
search key of the record being searched for is hashed in the cell of the record to be removed. The copied record 
box 31 to provkie the address of a cell. In box 32, the 5 is then taken as tfic record to be removed and the pro- 
empty cell just past the end of the search chain of non- cess is continued untU the end of the search chain is 
empty cells is located, i.e., the first succeeding unoccu- reached. In box 54, the final copied record is marked 
pied ccU is found. In box 33. the procedure moves one empty prior to teminating the procedure. The remove 
cell backward from the current cell position (now at the procedure of FIG. 4 might comprise a portion of the 
end of the Cham). Decision box 34 examines the ceU to 10 data base manager program 23 of FIG- 2. 
determine whether the cell is empty ornot. If the cell Starting at starting box $0 of FIG. 4, the procedure is 
tcstedm decision box 34 h empty, decision box 35 is entered with the location of a cell to be removed which 
entered to dctcrmmc if a key maU:h >*«s previously ^ called the base ccU. InitiaUy, box 51 is entered where 
found m decision box 41 (as wiU be described Wow). If ^ount in the table is adjusted to reflect the 

*o. the search is successful and rctuiro suco^s m box 36 15 .^moval of one record. The load, of course, is the num. 
J^i^^I^T^'"^ )?K ^ ^[ """Si**?* r ^ o^^P^ <^^^ As previouriy noted, the value of 

°T ^l^^^f" this load cii be used to disable L insertion of new 

l^A w """^ ^r*^^ until the load has reached a low enough value 

smce an empty ceU was found before a cell with a ^ ^^^^^^ searching. In box 52. theproc^ 

^tchmg key. The procedure again termmates m box 20 ^^^^ ^ ^^^^ ^ ^ L^^^Z 

if the cell tested in decision box 34 is not empty. J' /^^^^ ^^r^ 

decision box 40 is entered to determine if the record in ^.l*'^?^ "^^'^^ '^^^ *'^'?'k'T ^ 
that cell has expired. This is determined by comparing "^^ed and box 54 is entered to mark the base cell as 
some portion of the contents of the record to^ome 25 ''"P'y- I^^on ^^ 55 is then entered to^^ 
external condition. A timestamp in the record, for ex- was fomid (by the search table procedure) which 

ample, could be compared with the timeK>f-day. Alter- ««ched the search key and if so, the procedure is 
natively, the occurrence of an event can be compared terminated m termmal box 56. If a matchmg record was 
with a field identifying that event in the record. In any « entered to determine if the 

event, if the record has not expired, decision box 41 is 30 ^ " ****** location of the search key. 

entered to determine if the key in this record matches procedure is terminated in box 56. If the base 

the search key. If it does, the cell location is saved in the search record, then the base 

box 42 and the procedure returns to box 33. If the re- cell can be used for storing a new record. In box 58. the 
cord key does not match the search key, the procedure location of this empty cell is therefore saved as a possi- 
retums directly to box 33. 35 insertion rite. 

If decision box 40 determines that the record has Returning to box 53, if the next cell is not empty, box 
expired, box 43 is entered to perform a non<x)ntaminat- ^ entered to determine if the record in this cell 
ing deletion of the expired record, as will be described hashes ahead of the base cell. If so, box 52 is re-entered 
in connection with FIG. 4. In general, the procedure of to advance to the next cell in the chain. If this next cell 
box 43 (FIG. 4) operates to move a record further 40 hashes at or behind the base cell, however, box 60 is 
toward the end of the chain into the position of the entered to copy the contenU of this next cell to the base 
record which has expired, thereby removing the ex- cell, thereby obliterating (removing) the base cell oon- 
pired record and. at the same time, closing the search tents. Box 61 is then entered to test if the search table 
chain. procedure found a matching record. If not, box 52 is 

It can be seen that the search table procedure of FIG. 45 re-entered to advance to the next cell. If a matching 
3 operates to examine the entire chain of records of record was found, decision box 62 is entered to test if 
which the searched-for record is a part, and to delete the matching record is the base cell record. If not, box 
expired records by chain-fdling rather than by marking 32 is re-entered to advance to the next cell. If the match- 
such records as deleted. In this way, contamination of ^$ record is the base cell, however, box 63 is entered to 
the storage space by expired records is removed in the 50 store the location of the former base cell as the position 
vicinity of each new table search. If contamination of the matching record and then box 52 is re<«atered to 
becomes too large even with such automatic garbage advance to the next ceO in the search chain, 
oc^ection, then the insertion of new records can be It can be seen that the procedure of FIG. 4 operates 
inhibited until the search table procedure has had a to examine the entire search chain and to move records 
chance to remove a sufTident number of expired re- 53 from later poations in the chain to vacated positions in 
cords to render the operation of the system sufficiently the chain such that the chain is entirely closed at the end 
efficient of the procedure. That is, no empty cells are left to 

The search table procedure illustrated generally in erroneously break up a search chain. As noted in oon- 
FIG. 3 is implemented in the Appendix as PASCAL- necticm with FIG. 3, expired records are subjected to 
lake pseud o code. Source code suitable for compilation 60 the remove procedure of FIG. 4. As wiU be noted in 
and execution on any itandard hardware and software connection with FIG. 7, records to be deleted ftorn the 
computing system can readily be devised from this data base are also subjected to the /vmow procedure of 
pseudocode and the flowcharts of the figures by any FIG. 4. 

perKm of ordinary skill in the art. The remove procedure illustrated generally in FIG. 4 

In FIG. 4 there is shown a flowchart of a remove 65 is implemented in the Appendix as PASCAL*like 
procedure which removes records from the database. pseudocode. Source code suitable for compilation and 
either records to be deleted or expired records. In gen- execution on any standard hardware and software com- 
eral. this is accomplished by traversing the chain of the puting system can readily be devised from this pseudo- 
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code and the flowchart of FIG. 4 by any person of with a matching key. If not, box 103 is entered to report 

ordinary skUl in the art. failure of the deletion procedure and the procedure is 

In FIG. 5 there is shown a detailed flowchart of an terminated in box 106. If a matching record was found, 

insert procedure suiuble for use in the information stor- as determined by box 102, the remove procedure of FIG. 

age and retrieval system of the present invention. The 3 4 is invoked in box 104. As noted in connection with 

insert procedure of FIG. 5 begins as starting box 70 FIG. 4, this procedure removes the record to be deleted 

from which box 71 is entered. In box 71, the search table and, at the same time, closes the search chain. Box 105 

procedure of FIG. 3 is invoked with the search key of is then entered to report successful deletion to the call- 

the record to be inserted. As noted in connection with ing program and the procedure is terminated in box 106. 

HG. 3, the search table procedure locates the target cell 10 TIk delete procedure illustrated generally in FIG. 7 is 

location and, if part of a search chain, removes all ex- implemented in the Appendix as PASCAL-like pseudo- 

pired cells from that search chain. Decision box 72 is code. Source code suitable for compilation and execu- 

tben entered where it is determined whether or not the tion on any standard hardware and software computing 

search table procedure found a record with a matching system can readily be devised from this pseudocode and 

key. If so, box 73 is entered where the record to be 13 the flowchart of FIG. 7 by any person of ordinary skill 

inserted in put into the storage table in the position of in the art 

the old record with a matching key. In box 74, the insert The attached Appendix contains pseudocode listings 

procedure reports that the old record has been replaced for all of the programmed functions necessary to imple- 

by the new record and the procedure is terminated in ment a data base manager 23 (FIG. 2) operating in ac- 

tenmnal box 75. 20 cordance with the present inventkm. These listings 

Returning to decision box 72, if a matching record is follow the flowcharts of FIGS. 3-7 and further explain 

not found, decision box 76 is entered to determine if the and elucidate the flowcharts. Any person of ordinary 

table load is below a preselected threshold (typically skill in the art will have no difficulty implementing 

about 73% of the table capacity). If the load is not these functions in any desired program language to run 

below the threshold, the storage table is too full to be 25 on any desired computer hardware configuration, 

access efficiently, and box 77 is entered to report that It should also be clear to those skilled in the art that 

the the table is full and the record caxuot be inserted. further embodiments of the present invention may be 

The procedure then terminates in terminal box 75. If the made by those skilled in the art without departing from 

load is below the threshold, box 78 is entered where the the teachings of the present invention, 
record to be inserted is placed in the empty cell position 30 

found by the search table procedure. In box 79. the load mwix 

is adjusted to reflect the addition of one record to the 

storage table, the procedure reports that the record was Ts&etioa» mnted 

inserted in box 80 and the procedure terminated in box 

7S. 33 ^ following functions •» wte «v*il«bl« to th* 

The insert procedure illustrated generally in FIG. 5 is •ppne.tion pn^ru: 
implemented in the Appendix as PASCAL-like pseudo- 

code. Source code suitable for compQation and cxccu- (jasaEi* tvpg ) 
tion on any standard hardware and software computing 
system can readily be devised from this pseudocode and 40 
the flowcharts of the FIG. 5 by any person of ordinary 

skill in the art. 

In FIG. 6 there is show a detailed flowchart of a «u not round in th* tabi* «nd tha pasMd rvcerd ms" 

retrieve procedure which is used to retrieve a record nbsoquuiuy inswtad. 
from the data base 24 of FIG. 2. Starting in box 90, the 45 

search table procedure is invoked in box 91, using the ^^mm tnH if » record mocutod with jxax^.^ssL w 
key of the record to be retrieved as the search key. In ^ p"*^ 
box 92 it is determined if a record with a matching key ^"«« '-^or h» tmcm w« ^.^tgr . 
was found by the search table procedure. If not, box 93 
is entered to report failure of the retrieve procedure and 30 
the procedure is terminated in box 96. If a matching ,j,ti»« »ucc». if r«»rd «edoeut.d vita roeord .k«v v» 
record was found, box 94 is entered to copy the match- foand in tho t«bi« snd «ui«n«d to record , 
ing record into a bufTer store for processing by the 
calling program, box 95 is entered to return an indica- 
tion of successful retrieval and the procedure tcrmi- 55 R,tani» f«iiur« if sMrch m nnMccwsfui. 
nated in box 96. 

The pseudocode for the retrieve procedure of HG. 6 tes£2£4^» v^^-, tm) 

is included in the Appendix. Executable code for all ««cut«i with r^ord it«v w.. 

common hardware and system software arrangements found in th« t«bio and BUbso^wntiy doiot^d 
can readily be devised by those skilled in the art from 60 

the flowchart and the pseudO-OOde. aatarns f>Uur« if aono toond. 

In FIG. 7 there b i^wn a detailed flowchart of a 

deleu procedure useful for actively removing records oof initiaoa 
firotn the dau base 24 of FIG. 2. Starting at box 100, the 

pfocedureofHG. 7 first invokes the «a/cA tote proce- 65 ^ toiic^ii^ fonai ddfinition. m rdquind for 

dure of FIG. 3 ih box 101, using the key of the record to "T^li^ ^ i~«tion, r.tri«mi, ^ d^tion 

be deleted as the search key. In box 102. it is determined « 

if the search table procedure was able to locate a record \^^)>f f ttf /• •!» of huh t«bi« •/ 



BAturns Mloeod if a rmzd nueeUtwl with r«cord . te«v 
vu found In tha tabX* and aubMquontly rvpXrcwI. 

in»«rfd if « rocexd osseciatod with racord . fcav 
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/• bub t«bltt V 



10 

/• Lut two arguBvnts to r«oov« m 

nort nlovut bttro «/ 



w load : 0 



Algorltbu 



/• mntoor of occupiod «ntrl«o of 
hmmh tablo omy (initUlly 0) •/ 



If •oareh fblo (flffOT^ W> e»i&4«l) 



10 



jageritbM for tb« fnnetiom da»erlb*a abovo sn , 
givon bolow: 

iw—ft r r»eerd : roeord tvpo ) t 

f ropl«eod . Iniorfd . XS&LI )' 

vur PoaltAon t 0 *• fblo »iio- li 

/* position in tablo to vpdAto or 

iMsrt (r«tam«l by f— rch %^blm) */ 

bogltt 

if March tabl> f rteord . icoY . Dooitten > 
tb*a boglB 

tabla f poiltion l s- rocord f 



IS 



20 



du—fv v>ri«bl<l : 
titnn foooew l 
ad 

•IsM ntvTB I f Alttr« > 
•nd /• dolot* •/ 



•Bd 



if load/f blo < MX load factor 



load loag+l; 
t»blo f po»ltiom i» record ; 
return OaStEtlfi) 



fnAOtioB — reh f bl> (r«?9F^ ^•V imcof^ ^ type; 

Tsr BQ»itioft : 0 tablo mitm- U i booUan; 

25 /• aaarch tabU for xmcor^ kay and dolota oxpirad 

•vpirod rocorda in targat ebalni poaition la aat to 
indax of found racovd or appropriate «Bpty call •/ 

w is 0 ubisjOss-i' 

/• uaod for acaiming chain, 
^ botb foxwazda « bac3warda */ 



V 0 ■• tabla a^a-l; 
/• indax of loftaoat aapty call 
to right of ooaltion •/ 

is rac found : boolaan : 

/• indicataa whatbar aaarcb is succaasful V 



35 



alas rotnn f full l 
and /* inaart •/ 

fUBOtioft ratrlafva (w racord : raccrd tvpa ) ; 
(auccaaa , fallura l ; 

w eaalSiSQ: 0 1?ft»^a alra-lt 

/• poaitien in tabla vhara racord 

raaidaa (ratumad by aaareh table ) «/ 



bagib 5Q 
if ffprch tabU (£SCSEd-2»£r BSUUSIl) 



40 



45 



record t« tabla fpo aition l ; 



f failure) 



delete (£S£ac&AaZ> XS^SSJEST-fiEBft) * 



S5 



fiO 



65 



Tar poaitloni O ff^W^^ 

/• pooition In ^la vbare record 

resUoa (returned by aaarcb tabla) •/ 



begin 

position itasg jyaeord ^^y) j 

if tabla r ooaition l it not aspty than 



begin 

i »- position I /• loop initialisation •/ 

ropaat /• acan forvard to end o£ chain 

containing tabla fooaitionl •/ 

X 1- a^l) nod table a^e 

vaUI ( table t Al ia anpty) ; 



1 <« a-^*^ii-ftUft) 



vhilo (5^)Lt(11 i* Obt napty) do 
/«Bean chain in ranrarse, 
delating expired antrioa •/ 

begin 

K tnbld til is axpirad 



a, rec t^ya, 
ooaltion . noa anptvl 

elne if table til .^SX " kvt 
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im rmc f ocnd f 



If aet i« rmc foond tb«n poaltlon t- pq» CTPtv 



r»c fowidl 



!0 



15 



w DO* Of «««reh r«e. 



' 0 •* t«bl« all**!: 



/« 0«l«t« fblTc*!! to d«n V 



25 



do Cor«v«r 



1 s- 9*11 ^ 

/• Mva pMitiea of «Bptl*d slot 



50 



35 



/« Bcaa fervmrd looking for » 

record to fill hoi* in chkiit */ 

cfll te { <Mll to d»H >ll wd f bl« •izm t 

if taiblo f c«ll to d«n is sKpty 

ttaSB bsgia 

£sbl£[i] im mmptYf 

it bot j^f r>9 geund th«a 



If (pos 9f f fSfcf^ rsc 1 i < pos y=f--v) 
« U< WtY < fff Mtryh r«e> 45 

(PO» ^FPtY< tK>s of ssarch rsc < 
1> than PCS srotv 



xstnn 
wdi 



50 



1 hash ttsblsfesll to dsl l.tovl 

ntil CI < i < e«ll to d«l> 
ox a < oil to dsl < 1> 
w feoll to dsK 1 < iW 



55 



!• tsblof e*!! to dsn I 
«M fblo f csll to doll to plug hols in auin •/ 



12 



What is cbumed is: 

1. An information storage and retrieval system using 



hashing techniques to provide rapid access to the re- 
cords of said system and utilizing a linear probing tech- 
nique to store records with the same hash address, at 
least some of said records automatically expiring, said 
system comprising 
a record search means utilizing a search key to access 

a chain of records having the same hash address, 
said record search means including means for identi- 
fying and removing all expired ones of said records 
from said chain of records each time said chain is 
accessed, and 
means, utilizing said record search means^ for insert- 
ing retrieving and deleting records from said sys- 
tem and, at the same time, removing all expired 
ones of said records in the accessed chains of re- 
I cords. 

2. The information storage and retrieval system ac- 
jcording to claim 1 further comprising 

20 means for recursively moving a record from a later ^ 
position in said chain of records into the position of ^ 
one of said expired records. 

3. The information storage and retrieval system ac- 
cording to chum 1 further including 

means for counting the number of records in said 
system, 

means, responsive to said counting means, for inhibit- 
ing the insertion of new records into said system 
when the number of records in said system exceeds 
a preselected value. 

4. The information storage and retrieval system ac- 
cording to claim 3 further including 

means, also responsive to said counting means, for 
re-enabling the insertion of new records into said 
system when the number of records in said system 
falls below said preselected value. 

5. A method for storing and retrieving information 
records using hashing techniques to provide rapid ac- 
cess to said records and utilizing a linear probing tech- 

40 nique to store records with the same hash address, at 
least some of said records automatically expiring, said 
method comprising the steps of 
accessing a chain or records having the same hash 
address, 

identifying the automatically expired ones x>f said 
recOTds, 

removing all aut<Hnatically expired records from said 
chain of records each time said chain is accessed, 
and 

inserting, retrieving or deleting one of said records 
from said system following said step of removing. 

6. The method according to claim S further compris- 
ing the step of 

moving a record from a Uter position in said chain of 
records into the pontion of one of said expired 
records. 

7. The method according to claim 5 further compris- 
ing the steps of 

counting the number of records in said system, and 
inhibiting the insertion of new records into said sys- 
tem when the number of records in said system 
rises above a preselected value. 

8. The method according to claim 7 further compris- 
ing the step of 

re-eiubling the inserticm of new records into said 
system when the number of records in said system 
falls below said preselected value. 



60 
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